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SEQUENCE LISTING 

3 (1) GENERAL INFORMATION: 

4 (i) APPLICANT: Nico Cerletti 

5 (ii) TITLE OF INVENTION: New process for the production of 

6 biologically active protein 

7 (iii) NUMBER OF SEQUENCES: 13 

8 (iv) CORRESPONDENCE ADDRESS: 

9 (A) ADDRESSEE: Novartis Patent Department 

10 (B) STREET: 564 Morris Avenue 

11 (C) CITY: Summit 

12 (D) STATE: New Jersey 

13 (E) COUNTRY: USA 

14 (F) ZIP: 07901 

15 (V) COMPUTER READABLE FORM: 

16 (A) MEDIUM TYPE: Floppy disk 

17 (B) COMPUTER: IBM PC compatible 

18 (C) OPERATING SYSTEM: PC-DOS/MS-DOS 

19 (D) SOFTWARE: Patentln Release #1.0, Version #1.30 (EPO) 

20 (vi) CURRENT APPLICATION DATA: 

C--> 21 (A) APPLICATION NUMBER: US/09/813 , 27 IB 

C--> 22 (B) FILING DATE: 20-Mar-2001 

23 (vii) PRIOR APPLICATION DATA: 

24 (A) APPLICATION NUMBER: PCT/EP95/02719 

25 (B) FILING DATE: 12-Jul"95 

26 (A) APPLICATION NUMBER: EPO 94810439.3 

27 (B) FILING DATE: 25-Jul-94 

2 8 (viii) ATTORNEY/AGENT INFORMATION: 

29 (A) NAME: Pfeiffer, Hesna J. . 

30 (B) REGISTRATION NUMBER: 22640 

31 (C) REFERENCE/DOCKET NUMBER: 4 - 2003 9C/C1C1/USN 

32 (ix) TELECOMMUNICATION INFORMATION: 

33 (A) TELEPHONE: (908) 522-6940 

34 (B) TELEFAX: (908) 522-6955 

3 5 (2) INFORMATION FOR SEQ ID NO: 1: 

36 (i) SEQUENCE CHARACTERISTICS: 

37 (A) LENGTH: 339 base pairs 

38 (B) TYPE: nucleic acid 

39 (C) STRANDEDNESS : double 

40 (D) TOPOLOGY: linear 

41 (ii) MOLECULE TYPE: CDNA to mRNA 

42 (iii) HYPOTHETICAL: NO 

4 3 (vii) IMMEDIATE SOURCE: 

44 (B) CLONE: E. coli LC137/pPLMu . hTGF-betal (DSM 5656) 
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A <^ 

4 D 




( ix) 


FEATURE 




























4 0 






(A) 


NAME/KEY: CDS 






















A n 
4 / 






(B) 


LOCATION: 1 


. .336 




















A Q 
4 O 






.(D) 


OTHER : 


INFORMATION : /product= 


= "human 


TPV - 
±\j r 


■betal" 






A Q 
4 y 




(XX ) 


SEQUENCE DESCRIPTION: SEQ ID NO: 


1 : 














^ n 
DU 






X \3 


PAP 


APP 


AAC 


TAT 


TGC 


TTC ■ 


AGC 


x v_<_ 


APP 


CAC 


AAG 


AAC 


TPP 


TPP 


DX 




Ala 


T. on 


A ar*i 


T Vl T* 
J. 1IX 


noil 


TVT* 

i yx 


PVQ 


Phe 


Ser 


OCX 


Thr 


C~\ 11 


Lys 


Asn 


Pvc 


P VQ 


D Z 




X 








5 










1 0 










15 




^ "3 
JJ 






ppp 


PAP 


PTP 


TAC 


ATT 


GAC 


TTC 


CGC 




CAC 


PTP 


ccc 


X VJVJ 


A AP 


TPP 

X OV3 


D4 




Val 


A YTT 

c\ x y 




xicu 




He 


Asp 


Phe 


Arg 


Lys 


Asp 


Leu 


Gly 




Lys 


irp 






















25 










30 






DO 




AIL. 


CAC 


GAG 


ccc 


AAG 


GGC 


TAC 


CAT 


GCC 


AAC 


TTP 


TPP 
X 


CTC 


GGG 


ppp 


TPP 

X 


D / 




Tl Q 

i xe 


His 


Glu 


Pro 


Lys 


Gly 


Tyr 


His 


Ala 


Asn 


Phe 


P VG 


Leu 


Glv 


rlU 


PVQ 


D O 








35 










40 










45 








jy 






TAC 


ATT 


TGG 


AGC 


CTG 


GAC 


ACG 


CAG 


TAP 


APP 


A An 


PTP 


PTP 


PPP 


PTP 


D u 




Pr*o 


Tyr 


He 


Trp 


Ser 


Leu 


Asp 


Thr 


Gin 


T VT" 


Ser 




Val 


Leu 


Ala 


Leu 


D JL 






50 










55 










fin 










O Z 




TAC 


AAC 


CAG 


CAT 


AAC 


CCG 


GGC 


GCC 


TCG 


GCG 


GCG 


ppp 


TGC 


TGC 


PTP 


ppp 






Tyr 


Asn 


Gin 


His 


Asn 


Pro 


Gly 


Ala 


Ser 


Ala 


Ala 


IT X (J 


Cys 


Cys 


Val 


Dm 


04 




65 










70 










7 S 
















CAG 


GCG 


CTG 


GAG 


CCG 


CTG 


CCC 


ATC 


GTG 


TAP 


TAP 


VJ J. KD 


ccc 


ppp 


AAG 


CCC 


O 0 




Gin 


Ala 


Leu 


Glu 


Pro 


Leu 


Pro 


He 


Val 


T VT* 

xyx 


T VT* 


V d X 


Plv 


Ara 
rtxy 


T.VQ 

j-j _y o 


Pt*o 














85 










QO 














O O 




AAG 


GTG 


GAG 


CAG 


CTG 


TCC 


AAC 


ATG 


ATC 


riTP 


CCC 

^ VJ V_ 


TPP 


TPP 


AAG 


TPP 


APP 






Lys 


Val 


Glu 


Gin 


Leu 


Ser 


Asn 


Met 


He 


V el X 


A t*<t 

Arg 


Cot* 


Pir g 


j-j y o 




otiX 


/ U 










100 










105 










110 

X X. \J 






7 T 
/ X 




TGA 
































7 ^ 


/ 7 \ 


INFORMATION FOR { 


SEQ : 


ID NO: 2: 




















7 A 
1 4 




(i) 


SEQUENCE CHARACTERISTICS: 


















7 ^ 






(A) 


LENGTH 


: 112 amino acids 
















7 £ 






(B) 


TYPE: < 


amino acid 




















7 7 






(D) 


TOPOLOGY : i 


linear 




















7 Q 




(ii) 


MOLECULE TYPE: protein 




















7 Q 




(xi) 


SEQUENCE DESCRIPTION: SEQ ID NO: 


: 2 : 














oil 




Ala 


Leu 


Asp 


Thr 


Asn 


Tyr 


Cys 


Phe 


Ser 


Ser 


Thr 


Pi n 


Lys 


Asn 


y o 




Q 1 




1 








5 










10 










X J 




P 7 




Val 


Arg 


Gin 


Leu 


Tyr 


He 


Asp 


Phe 


Arg 


Lys 


Asp 


T.oil 

jjcu 


Gly 


Trp 


T.VQ 


xxp 












20 










25 










30 






84 




He 


His 


Glu 


Pro 


Lys 


Gly 


Tyr 


His 


Ala 


Asn 


Phe 


Cys 


Leu 


Gly 


Pro 


Cys 


85 








35 










40 










45 








86 




Pro 


Tyr 


He 


Trp 


Ser 


Leu 


Asp 


Thr 


Gin 


Tyr 


Ser 


Lys 


Val 


Leu 


Ala 


Leu 


87 






50 










55 










60 










88 




Tyr 


Asn 


Gin 


His 


Asn 


Pro 


Gly 


Ala 


Ser 


Ala 


Ala 


Pro 


Cys 


Cys 


Val 


Pro 


89- 




65 










70 










75 










80 


90 




Gin 


Ala 


Leu 


Glu 


Pro 


Leu 


Pro 


He 


Val 


Tyr 


Tyr 


Val 


Gly 


Arg 


Lys 


Pro 


91 












85 










90 










95 




92 




Lys 


Val 


Glu 


Gin 


Leu 


Ser 


Asn 


Met 


He 


Val 


Arg 


Ser 


Cys 


Lys 


Cys 


Ser 


93 










100 










105 










110 






95 


(2) 


INFORMATION ] 


FOR , 


SEQ ; 


ID NO: 3: 





















48 



96 



144 



192 



240 



288 



336 



339 
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w- 



w- 



w--> 



w- 



w- 



w--> 



RAW SEQUENCE LISTING DATE: 02/12/2002 

PATENT APPLICATION: US/09/813 , 271B TIME: 08:59:40 

Input Set : N:\Crf3\02062002\I813271B.raw 
Output Set: N:\CRF3\02122002\I813271B.raw 



y D 


(i) SEQUENCE 


CHARACTERISTICS; 




















y / 




(A) 


LENGTH : 


339 


base pairs 


















98 




(B) 


TYPE: nucleic acid 




















y y 




(C) 


STRANDEDNESS : double 


















100 




(D) 


TOPOLOGY: 


linear 




















1 A 1 

101 


M i \ 

{ ±:L ) 


MOLECULE TYPE: cDNA 


to mRNA 


















i no 


( Vll ) 


IMMEDIATE SOURCE: 


























(B) 


CLONE : 


E . coli 


LC137/pPLMu , 


hTGF-beta2 i 


(DSM5657) 






1 A A 

104 


( IX ) 


FEATURE : 




























inc 
lUJ 




(A) 


NAME/KEY : CDS 






















i a a 




(B) 


LOCATION: 1 


. . 336 




















TAT 
10 / 




(D) 


OTHER INFORMATION :/product= 


= "human 


TGF - 


-beta2" 






i no 
lUo 


(XI ) 


SEQUENCE DESCRIPTION: SEQ ID NO: 


3 : 














-i A A 

109 




TTG 


GAT 


GCG 


GCC 


TAT 


TGC 


TTT 


AGA 


AAT 


GTG 




GAT 


AAT 


TCC 


TPP 


110 


Ala 


Leu 


Asp 


Ala 


Ala 


Tyr 


Cys 


Phe 


Arg 


Asn 


Val 


C^ n 


Asp 


Asn 


Cys 




ill 
111 






115 










120 










125 








112 


nmjv 
L1A 


CGT 


CCA 


CTT 


TAC 


ATT 


GAT 


TTC 


AAG 


ACC 


GAT 


TTA 


GGG 


TGG 


AAA 
AAA 


TCC 


113 


Leu 


Arg 


Pro 


Leu 


Tyr 


He 


Asp 


Phe 


Lys 


Arg 


Asp 


Ti^ii 


Glv 




Lys 


Trp 


114 




130 










135 










14ft 
14U 










TIC 

115 


Al A 


CAC 


GAA 


ccc 


AAA 


GGG 


TAC 


AAT 


GCC 


A AT 


mmp 
J. -L \_» 


TftT 


CCT 


GGA 


flPA 


Tnp 


116 


T 1 0 

-L J_e 


His 


Glu 


Pro 


Lys 


Gly 


Tyr 


Asn 


Ala 


Asn ■ 


Phe 




Ala 


Gly 


Ala 


Cys 


1 1 "7 

117 


lie 










150 










155 










i fin 


lib 




TAT 


TTA 


TGG 


AGT 


TCA 


GAC 


ACT 


CAG 




acc 


A r±C 


CTC 


CTC 


ACC 

nut 


TTA 


iiy 


Pro 


Tyr 


Leu 


Trp 


Ser 


Ser 


Asp 


Thr 


Gin 


His 


Ser 


A >*rr 


Val 


Leu 


Car 




120 










165 










1 70 










17 5 
J. / j 




1 oi 


TAT 


AAT 


ACC 


ATA 


AAT 


CCA 


GAA 


GCA 


TCT 


GCT 


TCT 


PPT 


TGC 


TGC 


CTC 


TPP 


122 


Tyr 


Asn 


Thr 


He 


Asn 


Pro 


Glu 


Ala 


Ser 


Ala 


Ser 


riu 


Cys 


Cys 


Val 


Ocl 


1 1 o 

123 








180 










185 










190 






lz4 


CAA 


GAT 


TTA 


GAA 


CCT 


CTA 


ACC 


ATT 


CTC 


TAC 


TAC 


ATT 


GGC 


AAA 


APA 


CCC 


IzD 


Gin 


Asp 


Leu 


Glu 


Pro 


Leu 


Thr 


lie 


Leu 


Tyr 


Tyr 


lie 


Gly 


Lys 


Thr 


Drn 


126 






195 










200 










205 








127 


AAG 


ATT 


GAA 


CAG 


CTT 


TCT 


AAT 


ATG 


ATT 


GTA 


AAG 


TCT 


TGC 


AAA 


TCC 


ACC 


12 o 


Lys 


He 


Glu 


Gin 


Leu 


Ser 


Asn 


Met 


He 


Val 


Lys 


Ser 


Cys 


Lys 




Cot* 


izy 




210 










215 










220 










130 


TAA 
































1 j 2 


(2) INFORMATION FOR SEQ 


ID NO: 4 




















TOO 

loo 


(i) 


SEQUENCE CHARACTERISTICS: 


















1 "5 A 
1 0 ft 




(A) 


LENGTH 


: 112 amino acids 
















135 




(B) 


TYPE: amino acid 




















136 




(D) 


TOPOLOGY : 


linear 




















137 


(ii) 


MOLECULE TYPE: ] 


protein 




















138 


(xi) 


SEQUENCE DESCRIPTION: SEQ ID NO 


: 4: 














139 


Ala 


Leu 


Asp 


Ala 


Ala 


Tyr 


Cys 


•Phe 


Arg 


Asn 


Val 


Gin 


Asp 


Asn 


Cys 


Cys 


140 


1 








5 










10 










15 




141 


Leu 


Arg 


Pro 


Leu 


Tyr 


He 


Asp 


Phe 


Lys 


Arg 


Asp 


Leu 


Gly 


Trp 


Lys 


Trp 


142 








20 










25 










30 






143 


He 


His 


Glu 


Pro 


Lys 


Gly 


Tyr 


Asn 


Ala 


Asn 


Phe 


Cys 


Ala 


Gly Ala 


Cys 


144 






35 










40 










45 








145 


Pro 


Tyr 


Leu 


Trp 


Ser 


Ser 


Asp 


Thr 


Gin 


His 


Ser 


Arg 


Val 


Leu 


Ser 


Leu 



48 



96 



144 



192 



240 



288 



336 



339 
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146 50 55 60 

147 Tyr Asn Thr lie Asn Pro Glu Ala Ser Ala Ser Pro Cys Cys Val Ser 

148 65 70 75 80 

149 Gin Asp Leu Glu Pro Leu Thr lie Leu Tyr Tyr lie Gly Lys Thr Pro 

150 85 90 95 

151 Lys lie Glu Gin Leu Ser Asn Met lie Val Lys Ser Cys Lys Cys Ser 

152 100 105 110 

154 (2) INFORMATION FOR SEQ ID NO: 5: 

155 (i) SEQUENCE CHARACTERISTICS: 

156 (A) LENGTH: 339 base pairs 

157 (B) TYPE: nucleic acid 

158 (C) STRANDEDNESS : double 

159 (D) TOPOLOGY: linear 

160 (ii) MOLECULE TYPE: CDNA to mRNA 

161 (vii) IMMEDIATE SOURCE: 

162 (B) CLONE: E. coli LC137/pPLMu . hTGF-beta3 (DSM 5658) 

163 (ix) FEATURE: 

164 (A) NAME/KEY: CDS 

165 (B) LOCATION: 1. .336 

166 (D) OTHER INFORMATION : /product= "human TGF-beta3 M 





167 


(xi) 


SEQUENCE DESCRIPTION: SEQ ID NO: 


: 5: 


















168 


GCT 


TTG 


GAC 


ACC 


AAT 


TAC 


TGC 


TTC 


CGC 


AAC 


TTG 


GAG 


GAG 


AAC 


TGC 


TGT 


48 




169 


Ala 


Leu 


Asp 


Thr 


Asn 


Tyr 


Cys 


Phe 


Arg 


Asn 


Leu 


Glu 


Glu 


Asn 


Cys 


Cys 




w--> 


170 






115 










120 










125 












171 


GTG 


CGC 


ccc 


CTC 


TAC 


ATT 


GAC 


TTC 


CGA 


CAG 


GAT 


CTG 


GGC 


TGG 


AAG 


TGG 


96 




172 


Val 


Arg 


Pro 


Leu 


Tyr 


He 


Asp 


Phe 


Arg 


Gin 


Asp 


Leu 


Gly 


Trp 


Lys 


Trp 




w--> 


173 




130 










135 










140 














174 


GTC 


CAT 


GAA 


CCT 


AAG 


GGC 


TAC 


TAT 


GCC 


AAC 


TTC 


TGC 


TCA 


GGC 


CCT 


TGC 


144 




175 


Val 


His 


Glu 


Pro 


Lys 


Gly 


Tyr 


Tyr 


Ala 


Asn 


Phe 


Cys 


Ser Gly 


Pro 


Cys 




w--> 


176 


145 










150 










155 










160 






177 


CCA 


TAC 


CTC 


CGC 


AGT 


GCA 


GAC 


ACA 


ACC 


CAC 


AGC 


ACG 


GTG 


CTG 


GGA 


CTG 


192 




178 


Pro 


Tyr 


Leu 


Arg 


Ser 


Ala 


Asp 


Thr 


Thr 


His 


Ser 


Thr 


Val 


Leu 


Gly 


Leu 




w--> 


179 










165 










170 










175 








180 


TAC 


AAC 


ACT 


CTG 


AAC 


CCT 


GAA 


GCA 


TCT 


GCC 


TCG 


CCT 


TGC 


TGC 


GTG 


CCC 


240 




181 


Tyr 


Asn 


Thr 


Leu 


Asn 


Pro 


Glu 


Ala 


Ser 


Ala 


Ser 


Pro 


Cys 


Cys 


Val 


Pro 




w--> 


182 








180 










185 










190 










183 


CAG 


GAC 


CTG 


GAG 


CCC 


CTG 


ACC 


ATC 


CTG 


TAC 


TAT 


GTT 


GGG 


AGG 


ACC 


CCC 


288 




184 


Gin 


Asp 


Leu 


Glu 


Pro 


Leu 


Thr 


He 


Leu 


Tyr 


Tyr Val 


Gly Arg 


Thr 


Pro 




w--> 


185 






195 










200 










205 












186 


AAA 


GTG 


GAG 


CAG 


CTC 


TCC 


AAC 


ATG 


GTG 


GTG 


AAG 


TCT 


TGT 


AAA 


TGT 


AGC 


336 




187 


Lys 


Val 


Glu 


Gin 


Leu 


Ser 


Asn 


Met 


Val 


Val 


Lys 


Ser 


Cys 


*Lys 


Cys 


Ser 




w--> 


188 




210 










215 










220 













189 TGA 339 

191 (2) INFORMATION FOR SEQ ID NO : 6: 

192 (i) SEQUENCE CHARACTERISTICS: 

193 (A) LENGTH: 112 amino acids 

194 (B) TYPE: amino acid 

195 (D) TOPOLOGY: linear 

196 (ii) MOLECULE TYPE: protein 
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1 Q7 
J. j / 




SEQUENCE DESCRIPTION: SEQ ID NO: 


> u • 














Ala 
nld 


Leu 


Asp 


Thr 


Asn 


i J i 


Cys 


Phe 


Arg 


Asn 


Leu 


VjXU 


CZ, 1 n a on 








1 
J. 








c 
D 


















1 S 




o n f\ 
zUU 


Val 


Arg 


Pro 


Leu 


Tyr 


He 


Asp 


Phe 


Arg 




Asp 


Leu 


\3±y irp 


Lys 


Trp 


201 








20 










25 








30 






202 


Val 


His 


Glu 


Pro 


Lys 


Gly 


Tyr 


Tyr 


Ala 


Asn 


Phe 


Cys 


Ser Gly 


Pro 


Cys 


203 






35 










40 










45 






204 


Pro 


Tyr 


Leu 


Arg 


Ser 


Ala 


Asp 


Thr 


Thr 


His 


Ser 


Thr 


Val Leu 


Gly 


Leu 


205 




50 










55 










60 








206 


Tyr 


Asn 


Thr 


Leu 


Asn 


Pro 


Glu 


Ala 


Ser 


Ala 


Ser 


Pro 


Cys Cys 


Val 


Pro 


207 


65 










70 










75 








80 


208 


Gin 


Asp 


Leu 


Glu 


Pro 


Leu 


Thr 


He 


Leu 


Tyr 


Tyr 


Val 


Gly Arg 


Thr 


Pro 


209 










85 










90 








95 




210 


Lys 


Val 


Glu 


Gin 


Leu 


Ser 


Asn 


Met 


Val 


Val 


Lys 


Ser 


Cys Lys 


Cys 


Ser 


211 








100 










105 








110 







213 (2) INFORMATION FOR SEQ ID NO : 7: 

214 (i) SEQUENCE CHARACTERISTICS: 

215 (A) LENGTH: 336 base pairs 

216 (B) TYPE: nucleic acid 

217 (C) STRANDEDNESS : double 

218 (D) TOPOLOGY: linear 

219 (ii) MOLECULE TYPE: other nucleic acid 

220 (A) DESCRIPTION: /desc = "recombinant hybrid DNA of 

221 (vii) IMMEDIATE SOURCE: 

222 (B) CLONE: E. coli LC137/pPLMu . TGF-betal ( 4 4/4 5 ) beta3 

223 (ix) FEATURE: 

224 (A) NAME/KEY: mat_peptide 

225 (B) LOCATION: 1. .132 

226 (D) OTHER INFORMATION : /product= "N-terminal 44 amino 

227 acids of human TGF-betal" 

228 (ix) FEATURE: 

229 (A) NAME/KEY: mat_peptide 

230 (B) LOCATION: 133 .. 336 

231 (D) OTHER INFORMATION : /product^ "C-terminal 68 amino 

232 acids of human TGF-beta3" 

233 (ix) FEATURE: 

234 (A) NAME/KEY: CDS 

235 (B) LOCATION :1. .336 

236 (D) OTHER INFORMATION : /product= "hybrid TGF-beta named 

237 TGF-betal-3" 

2 38 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 



239 


GCC 


CTG 


GAC 


ACC 


AAC 


TAT 


TGC 


TTC 


AGC 


TCC 


ACG 


GAG 


AAG 


AAC 


TGC 


TGC 


48 


240 


Ala 


Leu 


Asp 


Thr 


Asn 


Tyr 


Cys 


Phe 


Ser 


Ser 


Thr 


Glu 


Lys 


Asn 


Cys 


Cys 




241 


1 








5 










10 










15 






242 


GTG 


CGG 


CAG 


CTG 


TAC 


ATT 


GAC 


TTC 


CGC 


AAG 


GAC 


CTC 


GGC 


TGG 


AAG 


TGG 


96 


243 


Val 


Arg 


Gin 


Leu 


Tyr 


He 


Asp 


Phe 


Arg 


Lys 


Asp 


Leu 


Gly 


Trp 


Lys 


Trp 




244 








20 










25 










30 








245 


ATC 


CAC 


GAG 


CCC 


AAG 


GGC 


TAC 


CAT 


GCC 


AAC 


TTC 


TGC 


TCA 


GGC 


CCT 


TGC 


144 


246 


He 


His 


Glu 


Pro 


Lys 


Gly 


Tyr 


His 


Ala 


Asn 


Phe 


Cys 


Ser 


Gly 


Pro 


Cys 
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